Random Matrices and Applications to Data Filtering

نویسندگان

  • QI WANG
  • Krishnamoorthy Sivakumar
  • Qi Wang
چکیده

by Qi Wang, M.S. Washington State University December 2003 Chair: Krishnamoorthy Sivakumar Preserving privacy is becoming an important issue in data mining. Random perturbation is a widely used technique to protect privacy of sensitive data values. This technique hides the true data records by modifying the data values using additive random noise, but can still estimate the data distribution from the perturbed data set. Our question in this thesis is: does this method really preserve privacy? Large random matrices have many properties. Random matrix theory has been widely used in nuclear physics, the study of the zeros of Reimann zeta function, the study of chaotic systems, and signal processing. After investigating some asymptotic properties of eigenvalues of covariance matrices, this thesis presents a random matrix-based data filtering technique, which we call spectral filtering. This method analyzes the eigenstates (spectrum) of the sample covariance matrix of observed data, and identifies the noisy eigenstates by applying the known asymptotic properties of random matrices. Experiments in the thesis show that this technique can produce good estimate of the true data for a reasonable value of the signal-to-noise ratio (SNR). This questions the effectiveness of additive random perturbation technique. Although spectral filtering technique has the ability to breach privacy in additive random perturbation, there exist other data mining techniques that can preserve privacy. One

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On-Line Nonlinear Dynamic Data Reconciliation Using Extended Kalman Filtering: Application to a Distillation Column and a CSTR

Extended Kalman Filtering (EKF) is a nonlinear dynamic data reconciliation (NDDR) method. One of its main advantages is its suitability for on-line applications. This paper presents an on-line NDDR method using EKF. It is implemented for two case studies, temperature measurements of a distillation column and concentration measurements of a CSTR. In each time step, random numbers with zero m...

متن کامل

Globally Optimal Multisensor Distributed Random Parameter Matrices Kalman Filtering Fusion with Applications

This paper proposes a new distributed Kalman filtering fusion with random state transition and measurement matrices, i.e., random parameter matrices Kalman filtering. It is proved that under a mild condition the fused state estimate is equivalent to the centralized Kalman filtering using all sensor measurements; therefore, it achieves the best performance. More importantly, this result can be a...

متن کامل

Application of Single-Frequency Time-Space Filtering Technique for Seismic Ground Roll and Random Noise Attenuation

Time-frequency filtering is an acceptable technique for attenuating noise in 2-D (time-space) and 3-D (time-space-space) reflection seismic data. The common approach for this purpose is transforming each seismic signal from 1-D time domain to a 2-D time-frequency domain and then denoising the signal by a designed filter and finally transforming back the filtered signal to original time domain. ...

متن کامل

Applications of Random Parameter Matrices Kalman Filtering in Uncertain Observation and Multi-Model Systems

This paper considers the Linear Minimum Variance recursive state estimation for the linear discrete time dynamic system with random state transition and measurement matrices, i.e., random parameter matrices Kalman filtering. It is shown that such system can be converted to a linear dynamic system with deterministic parameter matrices but state-dependent process and measurement noises. It is pro...

متن کامل

On Random Additive Perturbation for Privacy Preserving Data Mining

Title of Thesis: On Random Additive Perturbation for Privacy Preserving Data Mining Author: Souptik Datta, Master of Science, 2004 Thesis directed by: Dr. Hillol Kargupta, Associate Professor Department of Computer Science and Electrical Engineering Privacy is becoming an increasingly important issue in many data mining applications. This has triggered the development of many privacy-preserving...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003